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(57) Abstract 

The invention relates to cDNA molecules coding for eukaryotic topoisomerase I polypeptide. The invention pro- 
vides cDNA molecules comprising a part of the cDNA sequence of topoisomerase I which encode at least one epitope for 
autoantibodies to eukaryotic topoisomerase I. The invention also provides cloning vehicles capable of replication and ex- 
pression comprising cDNA molecules coding for eukaryotic topoisomerase I. The invention further provides for hosts 
transformed with a vehicle having a cDNA molecule coding for eukaryotic topoisomerase I. In another embodiment, the 
invention provides for the detection of autoantibodies to eukaryotic topoisomerase I using the eukaryotic topoisomerase I 
polypeptides coded for by the cDNA molecules of the invention. 



FOR THE PURPOSES OF INFORMATION ONLY 



Codes used to identify States party to the PCT on the front pages of pamphlets publishing international appli- 
cations under the PCT. 



AT 




FR 




ML 


Mali 


AU 


Australia 


GA 




MR 


Mauritania 


BB 


Barbados 


GB 


United Kingdom 


MW 


Malawi 


BE 




HTJ 


Hungary- 


NL 


Netherlands 


BG 


Bulgaria 


IT 


Italy 


NO 


Norway 


BJ 




JP 




RO 


Romania 


BR 


Brazil 


KP 


Democratic People's Republic 


SD 




CF 


Central African Republic 




of Korea* 


SE 


Sweden 


CG 




KR 


Republic of Korea 


SN 


Senegal 


CH 


Switzerland 




Liechtenstein. 


SU 


Soviet Union 


CM 


Cameroon 


LK 


Sri Laaka 


TD 


Chad 


DE 


Germany, Federal Republic of 


LU 


Luxembourg 


TG 


Togo 






MO 


Monaco 


IIS 


Hmtftd Statu nf Am 



WO 89/09222 



PCT/US89/01116 



-1~ 



CLONED HUMAN TOPOISOMERASE I 

BACKGROUND OF THE INVENTION 

This work was supported a Grant from the National Institutes 
of Health. The United States Government may retain certain rights 
in this invention. 
FIELD OF THE INVENTION 

The present invention relates to the use of reeombinant DNA 
technology for the transformation of a host organism with eukaryotic 
genetic sequences for the expression of topoisomerase I polypeptides 
and methods of using topoisomerase I polypeptides to detect auto- 
antibodies to topoisomerase I polypeptides. 
DESCRIPTION OF THE BACKGROUND ART 

DNA topoisomerases I and n catalyze the breaking and rejoin- 
ing of DNA strands in a way that allows the strands to pass through 
one another, thus altering the topology of DNA. Type I topoisom- 
erases break a single DNA strand, while the type n enzymes break two 
strands of duplex DNA. Both enzymes can perform a variety of simi- 
lar topological interconversions, including relaxation of super coiled 
DNA, knotting/unknotting and catenation/decatenation of duplex 
DNA. 

Both topoisomerases 1 and II can provide the topological 
interconversions necessary for transcription and replication. For 
example, topoisomerase I can provide the necessary unlinking activity 
for efficient in vitro DNA replication (Minden, et al., Journal of Bio- 
logical Chemistry , 260:9316, 1985; Nagata, et al., Proceedings of the 
National Academy of Sciences, U.S.A. . 80:4266, 1983; Yang, et al., 
Proceedings of the National Academy of Sciences, U.S.A. . 84:950, 
1987) however, topoisomerase n can also facilitate the replication of 
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SV40 DNA by HeLa cell lysates (Yang, et aL, Proceedings of__ the 
National Academy of Sciences, U.S.A. , 84:950, 1987). Genetic studies 
in yeast reveal that both replication and transcription proceed in 
single mutants deficient in either tapoisomerase I or n (Goto, et al. 
Proceedings of the National Academy of Sciences. U.S.A. , 82:7178, 
1985; Uemura, et al., EMBO Journal . 3:1737, 1984; Thrash, et al.. Pro- 
ceedings of the National Academy of Sciences. U.S.A. . 82:4274, 1985). 
In cells lacking both topoisomerases, transcription and replication are 
dramatically reduced (Uemura, et al., EMBO Journal , 5:1003, 1986; 
Brill, et al., Nature , 326:414, 1987). 

Several lines of evidence suggest that topoisornerase I normally 
functions during transcription. The enzyme has been shown to be 
localized preferentially to actively transcribed loci by immunofluor- 
escence (Fleishmann, et al., Proceedings of the National Academy of 
Sciences, U.S.A. , 81:6958, 1984), and by co-immunoprecipitation with 
transcribed DNA (GUnSrk et at, Cell . 44:401, 1986; Muller, et al., 
EMBO Journal , 1237, 1985)>-Furthermore, topoisornerase I cleavage 
sites have been mapped to regions in and around transcribed DNA 
(Bonner, et al., Cell, 41:541, 1985; Gilmour, et al., Molecular Cell Biol- 
ogy , 7:141, 1987; Stewart, et al., Cell , 50:109, 1987). Nonetheless, at 
least in yeast, topoisornerase n can apparently substitute for the 
functions of topoisornerase I in transcription (Uemura, et ai. T EMBO 
Journal , 3:1737, 1984; Thrash, et al., ibid). 

Of further interest is the use of topoisornerase I in classifying 
autoimmune disease. Autoimmune diseases are diseases in which an 
animal's immune system attacks its own tissues. Often the various 
types of autoimmune disease can be characterized based upon the 
specificity of autoantibodies which are produced. For example, it is 
well known that the serum of patients having the connective tissue 
autoimmune disease progressive systemic sclerosis (PSS), also known 
as scleroderma, frequently contain antibodies to such nuclear antigens 
as topoisornerase I. Thus, the ability to accurately- detect the 
presence of antibodies reactive with topoisornerase I can greatly 
assist in evaluating the prognosis and planning, or monitoring, the 
appropriate therapy of patients with scleroderma. 
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Unfortunately, the existing commercial dectection systems 
utilize crude exacts of cell nuclei, usually of rabbit or bovine origin, 
which contain antigens other than topoisomerase I. As a result, these 
systems will detect antibodies which react with nuclear antigens 
other than topoisomerase I such as, for example, the centromere, and, 
as a result, may give false positive results which can lead to an incor- 
rect diagnosis. Thus, there is considerable need for a system which 
detects only antibodies to topoisomerase I and not other eukaryotic 
peptides. 

SUMMARY OF THE INVENTION 

It is a major object of the present invention to provide a DNA 
sequence which encodes eukaryotic topoisomerase I polypeptide. 

Another object of the present invention is to provide a 
prokaryotic or eukaryotic host cell transformed or transfected with a 
DNA sequence which encodes eukaryotic topoisomerase I polypeptide 
such that the host cell can express this polypeptide. 

Yet another object of the present invention is to provide a 
biologically functional plasmid or viral DNA vector which includes a 
DNA sequence encoding eukaryotic topoisomerase I polypeptide. 

It is Still another object of the present invention to provide a 
DNA sequence consisting essentially of a DNA sequence encoding a 
polypeptide which has an amino acid sequence possessing at least one 
epitope for binding topoisomerase I autoantibodies. 

It is an object of the invention to provide a method of 
producing eukaryotic topoisomerase I polypeptide in host cells. 

Another object of the present invention is to provide a method 
of detecting autoantibodies to eukaryotic topoisomerase I. 

These and other objects are provided by one or more of the 
following embodiments of the present invention. In one embodiment 
a DNA sequence is provided which encodes eukaryotic topoisomerase I 
polypeptide. 

In another embodiment a biologically functional plasmid or 
viral DNA vector is provided which includes a DNA sequence 
encoding for eukaryotic topoisomerase I polypeptide. 
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In another embodiment, a method of producing eukaryotie to- 
poisomerase I polypeptide is presented which comprises providing a 
host cell which can replicate and express an intron-free DNA 
sequence of eukaryotie topoisomerase I polypeptide, growing the host 
cell, and recovering the eukaryotie topoisomerase I polypeptide. Also 
contemplated by the present invention are the transformed or 
transfected host cells themselves. 

In yet another embodiment eukaryotie topoisomerase I poly- 
peptide is provided which is substantially free of other eukaryotie 
non-topoisomerase 1 polypeptides. 

In still another embodiment, a method of detecting autoanti- 
bodies to eukaryotie topoisomerase I polypeptide is provided which 
comprises contacting a sample with the polypeptide expressed by the 
DNA sequence of the invention, incubating the sample with the 
polypeptide for a period of time and under conditions sufficient for 
the autoantibodies to bind to the polypeptide, separating the auto- 
antibodies which are bound to the polypeptide from the rest of the 
sample, and detecting the presence of the autoantibodies which bind 
to the polypeptide. 

A major advantage of the present invention is that it provides 
the art with a ready source of eukaryotie topoisomerase I polypeptide 
corresponding to that purified from natural sources, while avoiding 
the problems associated with the isolation of naturally occurring eu- 
karyotie topoisomerase I polypeptide to separate it from other eukar- 
yotie non-topoisomerase I polypeptides. This absence of other eukar- 
yotie non-topoisomerase 1 polypeptides is significant in that it allows 
the development of test systems which will only detect antibodies 
specif ically reactive with topoisomerase I polypeptides. 

Another advantage of providing eukaryotie topoisomerase I 
polypeptide in host cells is that by so doing it is possible to obtain 
much larger quantities of the polypeptide than are currently practica- 
bly available from natural sources. As a consequence, not only is it 
possible to use the polypeptide of the invention to more accurately 
classify patients with such autoimmune rheumatic diseases as 
scleroderma, but it is also now possible to also provide commercially 
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useful quantities of topoisomerase I polypeptide for use in diagnostic 
systems. 

The eukaryotic topoisamerase I polypeptides of the present 
invention retain the ability to bind autoantibodies to eukaryotic topo- 
isomerase I even though the prokaryotic host in which the 
polypeptides of the invention can be cloned degrades the transcribed 
topoisomerase I molecules into a spectrum of polypeptides. 
DESCRIPTION OF THE DRAWINGS 

Figure 1. Restriction map and strategy for sequencing topo- 
isomerase I eDNA clones T1A and TIB. The bold lines represent the 
cDNA coding sequences and the thin lines the presumptive 5' and 3' 
untranslated regions. A partial restriction endonuclease map is shown 
(X, Xho I; M, Xmn I; N, Nde I; H, Hind III; R, ECO RV; E f Eco RI). The 
arrows represent the individual subclones that were sequenced to 
determine the complete cDNA sequence. The subclone indicated by 
the asterisk was used to probe the DNA blot shown in figure 4. 

Figure 2. Expression of fp-topo I and comparison of the 
proteolysis maps of human nuclear topoisomerase I and fp-topo I. A) 
Immunoblots of lysates of bacteria harboring pATHIl (-) or 
pATHll-TIA(F) probed with an ti- topoisomerase 1 serum. B) Proteoly- 
sis maps. Human topoisomerase I <T) and fp-topo I (F) obtained by 
excision from NaDodS04 polyacrylamide gel. Lanes labeled "-" were 
loaded with gel slices (between 95 and 116kDa) excised from 
SDS-PAGE gels loaded with bacterial lysates that contained the 
pATHIl expression vector without the T1A cDNA. V8 protease or 
chymotrypsin were overlaid so that proteolysis could proceed during 
subsequent electrophoresis. An immunoblot probed with anti-topo- 
isomerase I autoimmune serum is shown. Molecular weight markers: 
200, 116, 95, 68, SO, 48, 40, 29, 18.4, 16.9 kDa. 

Figure 3. A) Topoisomerase I activity in lysates of bacteria 
harboring pATHIl or pATHll-TlA. Lysates of bacteria harboring 
pATHIl (-) or pATHll-TlA(+) were added to reaction mixtures con- 
taining either buffer P (with free Mg++) or buffer E (without free 
Mg++). B) Immunodepletion of Mg++-independent topoisomerase I 
activity from bacterial lysates. Lysates of bacteria harboring 
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pATHll-TIA were .immunoprecipitated with either normal human 
(NH) or with anti-topoisornerase I (alpha -topo) serum. The superna- 
tant (lane 1) and serial two-fold dilutions (lanes 2 and 3) were then 
assayed for topoisomerase I activity in the absence of free Mg++. 

Figure 4. Human topoisomerase I is encoded by a 4.1 kb mRNA 
transcribed from a single copy human gene. A) Human genomic DNA 
cut with EcoR I (R) or Hind III (H) was probed with the fragment from 
T1A which extends from codon 484 to 578 (fragment with asterisk in 
Figure 1, also see Figure 6). DNA size markers: 23130, 9416, 6557, 
4361, 2322, 2027, 1353 bp. B) Polyadenylated mRNA from HeLa cells 
probed with the entirety of T1A. RNA markers: 9490, 7460, 4400, 
2370, 1350 nucleotides. 

Figure 5. Sequence of topoisomerase I cDNA. The sequence 
was performed according to the strategy outlined in Figure 1. The 
sequence includes the presumptive 5' and 3' untranslated sequences of 
211 and 1139 bases, respectively. 

Figure 6 . Comparison of the amino acid sequences of human 
(bottom) and Saccharomyces cerevisiae (top) topoisomerase I as 
deduced from their DNA sequences. A) Double dots represent amino 
acid identities between S, cerevisiae and human type i topoisom- 
erases. Single dots represent similarity in sequence due to conserva- 
tive changes. Asterisks denote a large gap placed in the human 
sequence in order to maintain maximal alignment. B) Schematic dia- 
gram of regions of greatest similarity between human and S. 
cerevisiae amino acid sequences. Thin lines represent regions without 
apparent similarity while the heavy bars represent regions of signifi- 
cant similarity. . Regions shown as heavy bars show 55% identity or 
69% similarity when conservative changes are considered; 
DETAILED DESCRIPTION 

The present invention relates to the manipulation of genetic 
materials by recombinant procedures which make possible the produc- 
tion of polypeptides possessing part or all of the primary structural 
conformation for one or more of the epitopes for binding auto- 
antibodies to eukaryotic topoisomerase I. These polypeptides are 
highly useful for the immunological detection of autoantibodies 
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reactive with them since such autoantibodies are indicative of auto- 
immune rheumatic diseases. 

The cDNA clones produced by the present invention encode 
human topoisomerase I is demonstrated by the fact that: (1) partial 
proteolysis maps of the cloned fusion protein and naturally-occurring 
nuclear topoisomerase I are virtually identical; (2) the 67,700 Dalton 
carboxy-terminal portion of the cloned protein exhibits topoisom- 
erase I activity in vitro ; and (3) the cloned cDNA shows a high degree 
of sequence similarity with topoisomerase I from yeast (Thrash, et ah, 
ibid ). The hybridization data obtained indicate that human cells con- 
tain only a single locus complementary to this cDNA and that this 
locus gives rise to a single mRNA. 

Topoisomerase I shows a significant degree of evolutionary 
amino acid sequence conservation (42%) between man and yeast. A 
comparison of the polypeptide sequences of human and yeast enzymes 
(Figure 6A) reveals two regions of relatively stronger sequence simi- 
larity which are designated region I (435 residues, starting at 197 in 
the humans) and region n (the carboxy-terminal 70 residues). At opti- 
mal alignment both enzymes share a 54% amino acid sequence iden- 
tity which increases to 69% when conservative substitutions are 
included. A schematic which indicates the relative locations and 
extent of regions I and n is shown in Figure 6B. 

Interestingly, human and yeast topoisomerase I are predicted to 
be nearly the same molecular weight, despite the presence of approxi- 
mately 70 additional residues in the yeast sequence between regions I 
and n. These additional sequences in the yeast enzyme are balanced 
by the fact that in the human sequence 70 additional residues are pre- 
sent at the amino terminus. 

The amino terminal region of the human protein is remarkable 
for its sequence composition wherein 65% of the residues are charged 
based on the fact that of the 197 residues present in the non-similar 
region between yeast and human, 80 are basic and 48 are acidic. 
Remarkably, the entire protein has 43% charged residues overall. 

The cloned human topoisomerase I was found to have no 
significant sequence similarities with E. coli type 1 topoisomerase I. 
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Although, it has been previously shown that topoisomerase I from 
yeast and E. coli share a small region of sequence similarity, this 
region was not found in the human sequence for topoisomerase I. 

Studies on the topoisomerase I of the invention clearly indicate 
that the catalytic site is present in the carboxy-terminal 68 kDa 
region of the polypeptide. The highly charged amino terminal portion 
of the polypeptide may modulate the activity of the catalytic moiety 
or, alternatively, be involved in protein-protein interactions. Possible 
targets for such interactions include regulatory proteins, components 
of -the transcriptional apparatus, or perhaps histones. 

In the present invention cDNA clones encoding human topo- 
isomerase I were obtained from a library of human placental mRNA 
using the lambda gt 11 expression method (Young, et al.. Science . 
222:778, 1983). Clones expressing human topoisomerase I were 
screened by immuna-probing with an anti-topoisomerase I auto- 
immune serum. 

The development of specific DNA sequences encoding topo- 
isomerase I polypeptide for splicing into DNA vectors can be accom- 
plished using a variety of techniques. For example, alternative meth- 
ods which can be employed include: (1) the isolation of a double- 
stranded DNA sequence from the genomic DNA of the eukaryote; 
(2) the chemical manufacture of a DNA sequence to provide the nec- 
essary codons for the polypeptide of interest; and (3) the in vitro syn- 
thesis of a double-stranded DNA sequence by reverse transcription of 
mRNA isolated from a eukaryotic donor cell. In the latter case, a 
DNA complement of mRNA is formed which is generally referred to 
as cDNA. 

The manufacture of DNA sequences is frequently the method 
of choice when the entire sequence of amino acid residues of the 
desired polypeptide product is known. When the entire sequence of 
amino acid residues of the desired polypeptide is not known, the 
direct manufacture of DNA sequences is not possible and the method 
of choice is the formation of cDNA sequences. Among the standard 
procedures for isolating cDNA sequences of interest is the formation 
of plasmid-carrying cDNA libraries which are derived from reverse 
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transcription of mRNA which is abundant in donor cells which have a 
high level of genetic expression. In those cases where significant 
portions of the polypeptide's amino acid sequence are known, the pro- 
duction of labeled single stranded DNA probe sequences duplicating a 
sequence putatively present in the target cDNA may be employed in 
DNA/DNA hybridization procedures which are carried out on cloned 
copies of the cDNA which have been denatured into a single stranded 
form (Jaye, et al ., Nucleic Acid Research . .11:2325, 1983). 

Hybridization procedures are useful for the screening of 
recombinant clones by using labeled mixed synthetic oligonucleotide 
probes wherein each is potentially the complete complement of a 
specific DNA sequence in the hybridization sample which includes a 
heterogeneous mixture of single stranded DNAs or RNAs. These pro- 
cedures are particularly useful in the detection of cDNA clones 
derived from sources where extremely low amount of mRNA 
sequences relating to the polypeptide of interest are present. In other 
words, by using stringent hybridization conditions directed toward 
avoidance of non-specific binding it is* possible, for example, to allow 
the autoradiographic visualization of a specific cDNA clone by the 
hybridization of the target DNA to that single probe in the mixure 
which is its complete complement (Wallace, et al .. Nucleic Acid 
Research . 9:879, 1981). 

Of the three above-noted methods for developing specific DNA 
sequences for use in recombinant procedures the use of genomic DNA 
isolates is the least common. This is especially true when it is desir- 
able to obtain the microbial expression of mammalian polypeptides. 

The present invention provides novel polypeptides of eukar- 
yotic topoisomerase I which have part or all of the primary structural 
confirmation, that is, a continuous sequence of amino acid residues, 
having at least one epitope for autoantibodies to eukaryotic topoisom- 
erase I. 

When the eukaryotic topoisomerase I encoded by the DNA 
sequence of the invention is expressed in a prokaryotic host it is 
almost immediately degraded into a spectrum of polypeptide frag- 
ments. As a consequence, it was surprising the find that these 
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polypeptide f ragments retain at least one epitope capable of reacting 
with autoantibodies to eukaryotic topoisamerase I. Thus, it is possible 
to use the polypeptide fragments of the invention rather than intact 
eukaryotic topoisomerase I to detect autoantibodies to eukaryotic to- 
poisomerase I. 

The term "polypeptide," as applied to eukaryotic topoisom- 
erase I polypeptide, denotes any sequence of amino acids having an 
epitope for autoantibodies to eukaryotic topoisomerase I, wherein the 
sequence of amino acids is encoded by all or part of the cDNA 
sequences of the invention. 

The polypeptides resulting from microbial expression of the 
DNA sequences of the invention can tie further characterized by their 
freedom from association with other human polypeptides or other 
contaminants which might otherwise be associated with eukaryotic 
topoisomerase I in its natural cellular environment or in such extra- 
cellular fluids as plasma or urine. 

The term "host" is meant to include not only prokaryotes, but 
also such eukaryotes as yeast, filamentous fungi, as well as plant and 
animal cells which can replicate and express an intron-free DNA 
sequence of eukaryotic topoisomerase I. However, prokaryotes are 
preferred as the host organism. 

The term "prokaryotes" is meant to include all bacteria which 
can be transformed or transf ected with, the eukaryotic gene for the 
expression of eukaryotic topoisomerase I. Procaryotie hosts may 
include gram negative as well as gram positive bacteria such as t for 
example, E. coli, S. tvnhimurium , Serratia marcescens and Bacillus 
subtilis . 

A recombinant DNA molecule coding for the eukaryotic topo- 
isomerase I polypeptides can be used to transform or transfect the 
host using any of the techniques commonly known to those of ordinary 
skill in the art. Especially preferred is the use of a plasmid or a virus 
containing the eukaryotic topoisomerase I coding sequence for pur- 
poses of prokaryotic transf ormation or transf ection, respectively. 

Methods for preparing fused, operably linked genes and 
expressing them in bacteria are well-known in the art (Maniatis, et 
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ah, Molecular Cloning; A Laboratory Manual . Cold Spring Harbor 
Laboratory, Cold Spring Harbor, N.Y., 1981). The genetic constructs 
and methods described therein can be utilized for expression of eukar- 
yotic topoisomerase 1 in prokaryotic hosts. 

In general, expression vectors containing promoter sequences 
which facilitate the efficient transcription of the inserted eukaryotic 
genetic sequence are used in connection with the host. The expres- 
sion vector typically contains an origin of replication, a promoter, 
and a terminator, as well as specific genes which are capable of pro- 
viding phenotypic selection of the transformed cells. The trans- 
formed prokaryotic hosts can be grown in fermentors and cultured 
according to techniques known in the art to achieve optimal cell 
growth. The polypeptides of the invention can then be isolated from 
the growth medium, cellular lysates, or cellular membrane fractions. 

The isolation and purification of the microbially expressed 
polypeptides of the invention may be by any conventional means such 
as, for example, preparative chromatographic separations and immu- 
nological separations such as those involving the use of monoclonal or 
polyclonal antibody. 

By having provided the sequence of amino acid residues of eu- 
karyotic topoisomerase I, the present invention provides for the man- 
ufacture of DNA sequences which code for the host expression of 
polypeptide analogs or derivatives of eukaryotic topoisomerase I 
which differ from naturally-occurring forms in terms of the identity 
or location of one or more amino acid residues and which share some 
or all of the epitopes of naturally-occurring polypeptide forms. 

The novel DNA sequences of the invention include all 
sequences useful in providing the expression in prokaryotic or eukar- 
yotic host cells of polypeptides which have at least a part of the pri- 
mary structural confirmation for one or more epitopes capable of 
reacting with autoantibodies to eukaryotic topoisomerase I which are 
comprehended by: (a) the DNA sequence as set forth in Figure 5 or its 
complementary strand; (b) DNA sequences which hybridize to DNA 
sequences defined in (a) or fragments thereof; and (c) DNA sequences 
which, but for the degeneracy of the genetic code, would hybridize to 
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DNA sequences defined in (a) and (b) above. Specifically compre- 
hended in (b) are genomic DNA sequences which encode allelic 
variant forms of eukaryotic topoisomerase I. Part (c) specifically 
comprehends the manufacture of DMA sequences which encode eukar- 
yotic topoisomerase I, eukaryotic topoisomerase I fragments and eu- 
karyotic topoisomerase I analogs wherein the DNA sequences thereof 
may incorporate codons which facilitate translation of mRNA in non- 
vertebrate hosts. 

Since the cDNA sequence of the invention encodes essentially 
the entire human topoisomerase I molecule it is now a matter of rou- 
tine to prepare, subclone and express smaller polypeptide fragments 
of cDNA from this or a corresponding cDNA sequence which would 
encode as few as one epitope for autoantibodies to human topoisom- 
erase I. The presence of such an epitope on a cloned polypeptide can 
then be confirmed using, for example, sera from a patient with 
autoantibodies to topoisomerase I. 

The eukaryotic topoisomerase I of the invention is particularly 
suited for use in immunoassays in which it can be utilized in liquid 
phase or bound to a solid phase, carrier. In addition, the eukaryotic to- 
poisomerase I used in these assays can be detectably labeled in various 
ways. 

Examples of immunoassays which can utilize eukaryotic topo- 
isomerase I of the invention are competitive and non-competitive 
immunoassays in either a direct or indirect format. Examples of such 
immunoassays are the radioimmunoassay (RIA) the sandwich (immuno- 
metric) assay and the Western blot assay. Detection of antibodies 
which bind to the. eukaryotic topoisomerase I of the invention can be 
done utilizing immunoassays which run in either the forward, reverse, 
or simultaneous modes, including immunohistochemical assays on 
physiological samples. The concentration of eukaryotic topo- 
isomerase I which is used will vary depending on the type of immuno- 
assay and nature of the detectable label which is used." However, 
regardless of the type of immunoassay which is used, the concen- 
tration of eukaryotic topoisomerase I utilized can be readily 
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determined by one of ordinary skill in the art using routine experi- 
mentation. 

The eukaryotic topoisomerase I of the invention can be bound 
to many different carriers and used to detect the presence of antibody 
specifically reactive with the polypeptide. Examples of well-known 
carriers include glass, polystyrene, polyvinyl chloride, polypropylene, 
polyethylene, polycarbonate, dextran, nylon, amylases, natural and 
modified celluloses, polyacryiamides, agaroses and magnetite. The 
nature of the carrier can be either soluble in insoluble for purposes of 
the invention. Those skilled in the art will know of other suitable 
carriers for binding eukaryotic topoisomerase I, or will be able to 
ascertain such, using routine experimentation. 

There are many different labels and methods of labeling known 
to those of ordinary skill in the art. Examples of the types of labels 
which can be used in the present invention include enzymes, 
radioisotopes, colloidal metals, fluorescent compounds, chemilumin- 
escent compounds, and bioluminescent compounds. 

For purposes of the invention, the antibody which binds to eu- 
karyotic topoisomerase I of the invention may be present in various 
biological fluids and tissues. Any sample containing a detectable 
amount of antibodies to eukaryotic topoisomerase I can be used. Nor- 
mally, a sample is a liquid such as urine, saliva, cerebrospinal fluid, 
blood, serum and the like, or a solid or semi-solid such as tissue, feces 
and the like. 

The above disclosure generally describes the present invention. 
A more complete understanding can be obtained by reference to the 
following specific examples which are provided herein for purposes of 
illusration only and are not intended to limit the scope of the 
invention. 

EXAMPLE 1 

CLONING AND EXPRESSION OF TOPOISOMERASE I 
A. Recombinant DNA Procedures. cDNA . clones were 
obtained using the lambda gtll expression method of Young and Davis 
( Science . 222:778, 1983). lxlO 6 recombinant phage (lambda gtll 
library of human placental polyadenosine mRNA; Clontech) were 
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plated and screened by immunoprobing with an autoimmune anti-to- 
poisomerase I serum (Shero, et al., Science , 231:737, 1986), using 
standard methods (Maniatis, et al., ibid ). DNA and RNA hybridizations 
were performed according 1 to standard procedures (Maniatis, et al., 
ibid ). Blots were hybridized at 42 °C in the presence of 50% 
formamide and washed at 50 °C with a solution of 15 mM NaCl, 1.5 
mM Na citrate and 0.1% NaDodS04 (Maniatis, et al., ibid). cDNA 
sequences were determined by constructing an exonuelease III dele- 
tion series of clone T1A as described by Henicoff ( Gene , 28:351, 1984). 
The deleted fragments were subcioned into M13 for sequencing by the 
dideoxynucleotide chain termination method (Sanger, et al., Proceed- 
ings of the National Academy of Sciences, U.S.A. , 74:5463, 1977). 
Other sequences were determined by cloning specific restriction frag- 
ments into MIS. 

B. Expression of Topoisomerase I as a Fusion Protein in 
Bacteria. The cloned cDNA T1A (see Figure 1) inserted into the 
pATHll expression vector (provided by T.J. Koerner, Duke Univer- 
sity) was expressed in E. coli (DH1) as a fusion protein comprised of 
32kDa of the bacterial trpE protein linked to 67.7 kDa of the protein 
encoded by the cDNA, as previously described (Earnshaw, et al, Jour- 
nal of Cell Biology . 104 . 817, 1987). This fusion protein (fp-topo I) was 
induced and visualized by immunoblotting as described (Earnshaw, et 
al., ibid ). All immunoblotting was done with the same anti-topoisom- 
erase I autoimmune serum. 

1 x 10® recombinant phage were screened with topoisomerase I 
antiserum (Shero, et al., ibid ). Eight immunoreactive plaques were 
purified and the longest cDNA clone (TlA, 2929 bp) was analyzed by 
restriction mapping and sequenced according to the strategy outlined 
in Figure 1. Longer eDNAs were obtained by screening additional 
recombinant phage with the 5" Eco RI - Nde I fragment of TlA as a 
hybridization probe, 10 additional clones were obtained including 
clone TIB (3645 bp, Figure l), which appears to contain the entire 
coding region (see below). Both TlA and TIB terminate at their 5 1 
ends with an Eco RI site (located in the 3 1 nontranslated region) which 
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was also found in several other clones obtained from both the antibody 
and DNA hybridization screens. 

Clone T1A was subcloned into a bacterial expression piasmid 
(pATHll, in Earnshaw, et ah, ibid ) in which protein encoded by the 
cDNA is linked in a fusion protein to the 32 kDa amino terminal por- 
tion of the bacterial trpE protein. Bacterial lysates harboring 
pATHll containing T1A (pATHll-TIA) contained several immunore- 
active polypeptides (probably proteolysis fragments) that were not 
evident in lysates harboring only pATHll (Figure 2A). 

EXAMPLE 2 

CHARACTERIZATION OF CLONED TOPOISOMERASE I 

A. Proteolysis Mapping. Proteolysis maps of human topo- 
isomerase I and fp-topo I were performed as described (Cleveland, 
Methods in Enzvmology . 96:222, 1983), except that immunoblotting 
was used to visualize the proteolysis fragments instead of Coomassie 
blue staining. Gel bands corresponding to human topoisomerase I and 
fp-topo I (the largest two bands) were excised in small strips, inserted 
into the wells of another NaDodS04 gel, and overlaid with either 
chymotrypsin (100 g) or V8 protease from Staphylococcus aureus (1.0 
or 10.0 g). Electrophoresis was initiated at 50 volts until the dye 
completely entered the stacker. Proteolysis occurs as the proteins 
stack (Cleveland, ibid ). 

To examine the biochemical similarity between the fusion 
protein (fp-topo I) and human nuclear topoisomerase I, their 
proteolysis maps were compared. HeLa nuclear topoisomerase I and 
fp-topo 1 (the largest two species seen in Figure 2A) were obtained by 
excision from NaDodS04 polyacrylamide gels. Human nuclear topo- 
isomerase I and fp-topo I gave rise to nearly identical patterns of 
immunoreactive proteolytic fragments, indicating that the two 
polypeptides are closely related (Figure 2B). The virtual identity of 
these proteolysis maps indicates that the immunoreactive proteolysis 
fragments of fp-topo I contain nondetectable amounts of trpE. 

B. DNA Topoisomerase I Assays. Assays for bacterial and 
human topoisomerase I were performed essentially as described by 
Thrash et al. (ibid). 
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To confirm that clone T1A encoded topoisomerase I, fp-topo I 
was assayed for Mg ++ -independent supercoil-relaxing activity, a 
characteristic of eukaryotic topoisomerase I (Thrash, et ah, ibid ). 
Bacteria expressing fp-topo I (lanes +), and nonexpressing control 
bacteria (harboring pATHll only, lanes -> were lysed, and after pellet- 
ing- debris, aliquots of the supernatants were tested for their ability to 
relax super coiled DNA (Figure 3A). In buffer P which contains 
excess Mg ++ , lysates of bacteria that were not expressing fp-topo I 
contained supercoil-relaxing activity due to bacterial topoisomerase I, 
which requires Mg ++ as a cof actor (~, lane P). In buffer E, which does 
not contain free Mg ++ , bacterial topoisomerase I activity is abolished 
(-.lane E). In lysates of bacteria expressing fp-topo I, supercoil-relax- 
ing activity is evident both in the presence and absence of free Mg ++ 
(+, lanes P and E). This Mg^-independent activity, characteristic of 
eukaryotic type I topoisomerases, must be due to fp-topo I. The dif- 
ference in the distribution of topoisomers produced by bacterial and 
human type I topoisomerases is characteristic of the enzymes (Thrash, 
et aL. ibid). 

To demonstrate further that fp-topo I encodes human topo- 
isomerase I, lysates of bacteria expressing fp-topo I were immunopre- 
cipitated with anti-topoisomerase I autoimmune serum. Mg ++ - 
independent topoisomerase I activity was removed from these lysates, 
but not from control lysates treated with normal human serum 
(Figure SB). This data shows that T1A encodes a catalytically active 
portion of human topoisomerase I. 

To investigate whether topoisomerase I is encoded by a single 
gene or a family of genes as has been seen for many other eukaryotic 
proteins such as tubulins (Cleveland, et ah, Annual Review of Bio- 
chemistry . 54:331, 1985), blots of human genomic DNA that had been 
digested with either Eco RL or Hind III were analyzed. A DNA seg- 
ment spanning eodons 484 to 578 (fragment with asterisk in Figure l, 
and see Figure 6 A) was prepared from T1A and used as a hybridization 
probe. The resultant autoradiograph is shown in Figure 4A. Only a 
single DNA fragment was detected in either digest. Since moderate 
conditions of stringency were utilized, only a single locus highly 
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homologous to the cloned topoisomerase I sequence is apparently pre- 
sent in the human genome. 

To determine the size of the mRNA that encodes human topo- 
isomerase I, we probed blots of HeLa cell poly(A) RNA with the 
entirety of TlA. A single RNA species (Figure 4B) of 4100 nucleotides 
was detected. 

C. The Deduced Amino Acid Sequence of Human 
Topoisomerase I. 

The nucleotide sequence of the human topoisomerase I cDNA 
contained in clones TlA and TIB was determined according; to the 
strategy shown in Figure l. The 3645 base TIB sequence begins with a 
putative 5' untranslated region of 211 nucleotides, followed by a 765 
amino acid open reading frame, and terminates with 1139 nucleotides 
of 3' untranslated sequence. The encoded polypeptide has a molecular 
weight of 90,649 Da, and a calculated isoelectric point of 10.05 (26% 
basic and 18% acidic residues). The deduced molecular weight is 
somewhat smaller than the 100,0(30 Da estimate for authentic topo- 
isomerase I determined by NaDodSO,j~PAGE(Liu, et air Proceedings of 
the National Academy of Sciences. U.S.A. . 78:3487, 1981), however, 
apparent molecular weights may differ substantially from actual 
molecular weights (Earnshaw, et al, ibid ). 

Since the corresponding mRNA is 4100 bases in length, it is 
estimated that clone TIB is 250-350 amino acids short of full length 
assuming a poly(A) tail of 100-200. The missing sequence may be 
either 5' or 3' to the open reading frame. Since the 3' end of the 
clones is bounded by a natural Eco RI site, the true length of the 3* 
non translated region is unknown. 

It is not known whether the cDNA sequence presented in 
Figure 5 represents the entire protein coding region. Although the 
unambiguous identification of initiation methionines from cDNA 
sequences is problematic, the specified initiator codon appears to be 
correct for three reasons. 1) The first AUG in the clone is in frame 
with a translation unit that encodes a protein of approximately the 
same size as authentic human topoisomerase I. 2) This AUG codon is 
immediately preceded by nucleotides that have been associated with 
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eukaryotic start codons (Kozak, Cell . 44:283, 1986): a.purine is in posi- 
tion -3, and cytosines are in positons -1, -4 and -5. 3) The 211 bp 
sequence upstream of the putative start codon is 73% GC rich and is 
not deficient in CpG (the 211 bp sequence downstream of the putative 
start codon is only 40% GC rich). Such a sequence arrangement is 
characteristic of the 5' ends of many genes that do not display tissue 
specific expression (Bird, Nature . 321:209, 1986). Despite this, it 
should be noted that an open reading frame extends from the pre- 
sumptive initiation codon all the way to the 5 1 end of the clone. 
EXAMPLE 3 

DETECTION OF AUTOANTIBODIES TO TOPOISOMERASE I 
IN CLINICAL SPECIMENS 

Sera from normal individuals and from thirteen patients 
suspected of having autoantibodies to human topoisomerase I were 
tested using the Western blot technique (Earnshaw, et al., Journal of 
Clinical Investigation , 72:425, 1986) and the polypeptides expressed by 
the cDNA sequence of the invention. The thirteen sera from patients 
suspected of having autoantibodies to human topoisomerase I had pre- 
viously shown a positive reaction for the presence of such 
autoantiodies when tested in a commercially available diagnostic test 
which utilizes a crude extract from mammalian nuclei. When these 
same thirteen sera were tested against the polypeptides of the inven- 
tion only eleven of the patient's sera were positive. Since the West- 
ern technique is more sensitive than the technique utilized in the 
commercial test, two of these patients may have been false positive. 
Since the commercial test contains other nuclear antigens in addition 
to topoisomerase I such as, for example, the centromere, these 
patients sere may have reacted to one or more of these other non-to- 
poisomerase I antigens in the commercial test. None of the normal 
sera which were tested reacted with the polypeptides of the 
invention. 

The invention now being fully described, it will be apparent to 
. one of ordinary skill in the art that many changes and modifications 
can be made thereto without departing form the spirit or scope of the 
invention. 
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CLAIMS • • 

1. A cDNA sequence consisting essentially of a cDNA 
sequence encoding human topoisomerase I. 

2. A host cell transformed or transfected with the cDNA 
sequence of claim 1. 

3. A biologically functional plasmid or viral DNA vector 
including the cDNA sequence of claim 1. 

4. A host cell stably transformed or transfected with a 
DNA vector according to claim 3. 

5. A cDNA sequence consisting essentially of a cDNA se- 
quence encoding a polypeptide having an amino acid sequence 
possessing at least one epitope for autoantibodies to eukaryotic topo- 
isomerase I. 

6. The cDNA sequence according to claim 5 and including 
the protein coding region set forth in Figure 5. 

7. The cDNA sequence of claim 5, wherein said cDNA 
sequence codes for the expression of human topoisomerase I. 

8. A host cell transformed or transfected with a the cDNA 
sequence of claim 5. 

9. A method of providing eukaryotic topoisomerase I 
polypeptide having at least one epitope for autoantibodies to eukar- 
yotic topoisomerase I comprising: 

(a) providing a host cell which replicates and expresses 
an intron-free DNA sequence of eukaryotic topoisomerase I polypep- 
tide; 

(b) growing said host cell; and 

(c) recovering said eukaryotic topoisomerase I polypep- 
tide. 

10. The method of claim 9, wherein said eukaryotic topo- 
isomerase I is human topoisomerase I. 

11. A method of detecting autoantibodies to eukaryotic to- 
poisomerase I which comprises: 

(a) contacting a sample with a polypeptide encoded by 
the cDNA sequence of claim 1; 
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(b) incubating the components of step (a) for a period of 
time and under conditions sufficient for said autoantibodies to bind to 
said polypeptide; 

(c) separating said autoantibodies bound to said 
polypeptide from said sample; and 

(d) detecting the presence of said autoantibodies bound 
to said polypeptide. 

12. The method of claim 11 wherein said eukaryotic topo- 
isomerase I is human topoisomerase I. 

13. The method of claim 11 wherein said sample is from a 
human. 

14. The method of claim 11 wherein said detecting utilizes a 
detectably labeled protein capable of binding to said autoantibody. 

15. The method of claim 14 wherein said binding protein is a 
detectably labeled second antibody. 

16. The method of claim 15 wherein said detectable label is 
selected from the group consisting of a radioisotope, a fluorescent 
compound, a colloidal metal, a chemiluminescent compound, a 
biQluminescent compound and an enzyme. 
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